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p^ . Abstract 

L. K. Grover's search algorithm in quantum computing gives an optimal, quadratic speedup 
in the search for a single object in a large unsorted database. In this paper, we generalize 
Grover's algorithm in a Hilbert-space framework for both continuous and discrete time cases 
ff^ , that isolates its geometrical essence to the case where more than one object satisfies the search 

criterion. 

> 

m 

l> 

o 
o 
o 

:^ 

-(— > 

> 



1. Department of Mathematics, Texas A&M University, College Station, TX 77843-3368. 
E-mail: gchen@math.tamu.edu and fulling@math.tamu.edu. 

2. Supported in part by Texas A&M University Interdisciplinary Research Grant IRI 99-22. 

3. Department of Mathematical Sciences, University of Cincinnati, Cincinnati, OH 45221- 
0025. E-mail: chenjn@email.uc.edu. 



1 Introduction 

A quantum computer (QC) is envisaged as a collection of 2-state "quantum bits", or qubits 
(e.g., spin 1/2 particles). Quantum computation does calculations on data densely coded in the 
entangled states that are the hallmark of quantum mechanics, potentially yielding unprecedented 
parallelism in computation, as P. Shor's work on factorization ||l3|, |lj| proved in 1994. Two 
years later, L. K. Grover showed that for an unsorted database with N items in storage, it 
takes an average number of 0{\/N) searches to locate a single desired object by his quantum 
search algorithm. If A^ is a very large number, this is a significant quadratic speedup over the 
exhaustive search algorithm in a classical computer, which requires an average number of ^^ 
searches (see Remark A.l in Appendix). Even though Grover's algorithm is not exponentially 
fast (as Shor's is), it has been argued that the wide range of its applicability compensates for 
this H|. Furthermore, the quantum speedup of the search algorithm is indisputable, whereas 
for factoring the nonexistence of competitively fast classical algorithms has not yet been proved 

SI- 

Grover's original papers H, M deal with search for a single object. In practical applications, 
typically more than one item will satisfy the criterion used for searching. In the simplest 
generalization of Grover's algorithm, the number of "good" items is known in advance (and 
greater than 1). Here we expound this generalization, along the lines of a treatment of the 
single-object case by Farhi and Gutmann [pi that makes the Hilbert-space geometry of the 
situation very clear. 

The success of Grover's algorithm and its multiobject generalization is attributable to two 
main sources: 

(i) the notion of amplitude amplication; and 

(ii) the dramatic reduction to invariant subspaces of low dimension for the unitary operators 
involved. 

Indeed, the second of these can be said to be responsible for the first: A proper geometrical 
formulation of the process shows that all the "action" takes place within a two-dimensional, 
real subspace of the Hilbert space of quantum states. Since the state vectors are normalized, 
the state is confined to a one-dimensional unit circle and (if moved at all) initially has nowhere 
to go except toward the place where the amplitude for the sought-for state is maximized. This 
accounts for the robustness of Grover's algorithm — that is, the fact that Grover's original 
choice of initial state and of the Walsh-Hadamard transformation can be replaced by (almost) 
any initial state and (almost) any unitary transformation H, nO, m. 

The notion of amplitude amplification was emphasized in the original works [[7|, |g, |9| of Grover 
himself and in those of Boyer, Brassard, Il0yer and Tapp M and Brassard, H0yer and Tapp B. 



(See also |H, 0].) Dimensional reduction is prominent in the papers by Farhi and Gutmann 
P] and Jozsa [10|. We applied dimensional reduction to multiobject search independently of 
references jsl and H and later learned that the same conclusions about multiobject search (and 
more), in the discrete time case, had been obtained there in a different framework. 

The rest of the paper is divided into three parts. In §2, we present the continuous- time version 
of the multiobject search algorithm, and in §3 the discrete-time version. In the Appendix, the 
computational complexity of the classical random multiobject search algorithm is analyzed and 
some relevant points on the literature are also made. 

2 Continuous Time Quantum Computing Algorithm for 
Multiobject Search 

Farhi and Gutmann [|6| first considered quantum computation from a different point of view 
by regarding it as controlled Hamiltonian (continuous) time evolution of a system. This view is 
definitely proper, because quantum mechanical systems naturally evolve continuously in time. 
We inherit their point of view in this section. 

Let an unsorted database consist of N objects {wj | 1 < j < N}, and let / be an oracle (or 
Boolean) function such that 

/(-^)-{;; ^:J+M+2,...,iv. (2.1) 

Here the i elements {wj \ 1 < j < i} are the desired objects of search. However, in general i is 
not explicitly given. Note that the assignment of the "desired" objects to the first i values of 
the index j is just a convention for the purposes of theoretical discussion; from the point of view 
of the user of the algorithm, the N objects are in random or unknown order, or perhaps better, 
have no meaningful ordering whatever. Consider now a Hilbert space Ti. of dimension N with an 
orthonormal basis .B = {|wj) | 1 < j < N}; each \wj) is an eigenstate in the quantum computer 
representing the object Wj in the database. Denote L — span{|it;j) | 1 < j < ^}. Here we 
have adopted the notation of kct |-) and bra (-j in mathematical physics to denote, respectively, 
vectors and linear functionals in Ti. Suppose we are given a Hamiltonian H in Ti and we are 
told that H has an eigenvalue £' 7^ on the entire subspace L defined above and all the other 
eigenvalues are zero. The task is to find an eigenvector \wj) in L that has eigenvalue E. The 
task is regarded as completed when a measurement of the system shows that it is in the state 
\wj) for some j : I < j < £■ 

Define a linear operator H^, whose action on a basis element is given by 

^LK-) = f(K>-(-i)^^'"^'K», J = 1,2,..., TV. (2.2) 

Note that here we have only utilized the knowledge of /; no knowledge of the desired search 
objects {\wj) I j ~ 1,2, .. . ,£} is required or utilized since it is assumed to be hidden in the 



oracle (black box). Nevertheless, since H^ is a linear operator, through linear extension we know 
that Hl is uniquely defined, and it necessarily has the following unique "explicit" representation 






Wo 



(2.3) 



The explicitness of H^ in (2.3) is somewhat misleading. We need to emphasize that £ in 



(2.3) is not explicitly known or given since the only knowledge we have is / in (2.1). For the 
implementation of the algorithms here as well as in the next section, this does not constitute any 
problem, however, except that in most applications the determination of and the information 
about £ are important. This becomes a separate class of problems calling counting that is 
studied in M, which we hope to expound further in a sequel. As in H, we now add to H^ the 
"driving Hamiltonian" 

Hd = E\s){s\ (2.4) 

for some (yet arbitrary) unit vector \s) G H, \s) ^ L. This gives the overall Hamiltonian as 

H^Hl+Hd- (2.5) 

Our quantum computer is governed by the Schrodinger equation 



z^|V(i))=i?|^(i)), t>0, 
1^(0)) = N), 



(2.6) 

as a continuous- time, controlled Hamiltonian system. Since ( |2.6D i is autonomous, the state of 
the system at time t is given by 



|V'(i))=e-^^*|s), i>0, 

where e~*^* is the exponential N x N (time evolution) matrix. 
Define an augmented space L from L and \s): 

L = span(iU{|s)}). 



(2.7) 



(2.8) 



Let L-^ be the orthogonal complement of L satisfying L L-^ — H, where © denotes the 
orthogonal direct sum. With respect to this orthogonal decomposition, wc now have our first 
reduction of dimensionality below. 



Proposition 2.1. Fix \s) £ Ti, \s) ^ L in (2.6). Let H,L and L be defined as above. For 
any \w) € "H, write \w) — \v) + \u) E L Q) L^ according to the orthogonal direct sum. Then 
H\w) = H\v) and H\u) = 0. Consequently, the Hamiltonian H has an associated blockwise 
decomposition 



Hj 



H = 



0l2 
O22J 



(2.9) 



where Hj^ is an invertible (^ + 1) x (i + 1) matrix defined on L such that Hj^\v) =^ H\v) for all 
\v) £ L, and O12, O21 and O22 are, respectively, {£ + 1) x {N - £ — 1), {N ~ £ ~ I) x {£ + 1) and 
(N — £ — 1) X [N — £ — 1) zero matrices. 



Proof. Straightforward verification. 



D 



Corollary 2.2. Fix \s) E 7i, |s) ^ L in (2.6). Let H,L an L^ be defined as above. Then the 



state of the solution of (2J.) at time t, \'ip{t)), has zero component in L for all t > 0. 



Proof. The action of the evolution dynamics e~*^' on the invariant subspace L^ is, by ( |2.9| ), 
g"j022t — gO — Ii±, the identity operator on L-^. Since the component of \s) in L^ is the zero 
vector, the action of I^^i^i (the (A^ — ^ — 1) x [N — £ — 1) identity matrix) on it remains zero 
for all t > 0. D 



By the properties obtained above, we need to fix our attention only on Hj defined on L. By 



abuse of notation, we still write H instead of Hj^ on L. 



Proposition 2.3 (Matrix representation oi H on L). Under the same assumptions as in 



Prop. 2.1, define Xi — {s\wi) for i — 1,2, . . . ,£. Let 



i'-) = ^(i^)-E^^K)) 



O7- — 



\ 



i-Ei 



(2.10) 



Then {{wi), . . . , \w£), \r)} forms an orthonormal basis of L with respect to which H admits the 
matrix representation 






H = i^[i7,,](,+i),(,+i); H,, = <; (^^^^%;+ 5,,+ixOa, ^^£+lorJ^£+l,^^J, 



Proof. Solve ( pIo|) for \s): 



\s) = ^a;.,|mj) + C,. 



Substituting ( 2.12 ) into (2^) and (2^), we obtain ( 2.11 ) in bra-ket form. 



(2.11) 
(2.12) 

n 



The exponential matrix function e *^* on L based upon the representation can be obtained, 
e-g-, by 

«2 .fc 



-iRt 



= EfeH^)* 



fc=0 



(2.13) 



or 



iHt 



1 



— <^iCh+i+iH)-'e^'dC 



(2.14) 



where C is a simple closed curve in C enclosing all the values C such that the (^ + 1) x {£ + 1) 
matrix C-'^^+i + ^H is not invertible. However, since £ can be arbitrarily large, it is a highly 
nontrivial task (if possible at all) to calculate e~*^* explicitly based on (2.13), ( ^.14 ) or any 
other known techniques. 

It turns out that the above difficulty can be bypassed if \s) is chosen in an ingenious way 
that can effect another reduction of dimensionality. (Nevertheless, we again call attention to 
the fact that any choice of \s) must be independent of any knowledge of \wi), for z = 1, 2, . . . , i.) 
The choice by Grover 0, |l is 

N 



VA^S 



Wj 



(2.15) 



implementable and obtainable on the QC through an application of the Walsh-Hadamard trans- 
formation on all the qubits; \s) is a superposition state with the same amplitude in all eigenstates. 
With this choice of Is), we now have 



Xi = x=l/VN,i = l,2,... J; C^ = l-{i/N), \r) 



1 



V^-ii/N) 



1 '^ 



(2.16) 



in ( p.llD and ( p.lOl ). 

Theorem 2.4 (A tv^ro-dimensional invariant subspace for the Hamiltonian H). Let\s) 



be given as in (2.15). Denote 



V = {\v)eL \v) ^aJ2H) + b\r); a,beC}. 



(2.17) 



Then V is a invariant two-dimensional subspace of H in L such that 

(1) r, s, e V; 

(2) H{V) = V. 



Proof. It is obvious that (1) holds. To see (2), we have 



H\v) =H 



a^ \wi) +5|r 
Y,H\w,)+bH\; 



= H«E 



j=i 



(1 + x'^)\wi) + x^ ^ \wj) + x^\-ix'^\r) 






+ 5 



:VT^^^5I l^j) + (1 " ^•^^)l' 



i=i 



^ «E 






-fea; VT^~^^ |wj) + [afeVT^~^ + ^(1 - ^a;^ 



i=i 



e\ (a + afe^ + hx\/\ - ^^2 ^ |w,) + \aix\/\ - tx^ + 6(1 - ix^)]\r) \ G V. (2.18) 



□ 



Corollary 2.5. Define 



-y 



Wi 



(2.19) 



r/ien {\w), |r)} forms an orthonormal basis for V such that with respect to this basis, H admits 
the matrix representation 



H = E 



-'_Ljy_ N 



N 



N 



(2.20) 



with 



^-rHt ^ g-iBt 



s{Eyt) — iy sm{Eyt) — \J\ — y'^i sm{Eyt) 
- \J\ — y'^i sm(Eyt) cos{Eyt) + iy sm{Eyt) 

Proof. The representation ( 2.2C ) follows easily from (^.18| ) . 
To calculate e~*^*, write 



,y=^/I/N. (2.21) 



H = E 



l+jr__ yy/l-y'^ 



= V£/N, 



analogous to q, (8)], and apply (2.14) to obtain (2.21); or apply (2.13) and the properties of 
the SU{2) generators commonly used in quantum mechanics O, (XIII. 84), p. 546]. D 



Since V is invariant under H and i7 is a self-adjoint matrix, the orthogonal complement V''" 
of V is also an invariant subspace of H . The precise action of H or e*^* on V^ does not seem 
to be describable in simple terms. However, this knowledge is not needed as we now have the 
explicit form of the solution available below. 



Corollary 2.6. The solution 'ip{t) of (2.6) given (2.15) is 

^P{t) = e-^^''{[yms{Eyt)~iiiin{Eyt)]\w) + y/ 1 - y'^ cos{Eyt)\r)} , t > 0, 



where y — y^i/N. 



Proof. Use ( p. 211 ) and ( |2.16D . 



(2.22) 



n 



Note that (2.22) has the same form as |6|, (10)]. Since \w) and \r) are unit, mutually 
orthogonal vectors, the probability of reaching the state \w) at time t is 



P(t) = sin^ (Eyt) + 2/2 cos^ (Eyt) , y = ^fljN . 



(2.23) 



At time T = tt /{2Ey), the probability is 1. By ( ^.19 ), if we make a measurement of \'ip{t)) at 
t — T, we will obtain any one of the cigcnstates |wi), i — 1,2, . . . , £, with equal probability 1/i. 
Therefore the task of search is completed (with probability 1), requiring a total time duration 



T^./i2Ey)^^^^. 



(2.24) 



Formula (2.23) manifests the notion of amplitude amplication because the amplitude of |V'(0) 



along |r), y/l — y^ cos{Eyt), is steadily decreasing in magnitude as a function of t e [0, T] 



implying that P(t) in (2.23) is increasing for t G [0, T]. 

Next, let us address the optimality of the search algorithm as given above. We assume that 
N/£ is a large number. We show that the above generalized Grover-Farhi-Gutmann algorithm 
for multiobject search is time-optimal within the order 0{y/N/£). In contrast with the classical 



random search which requires in average {N + !)/{£+ 1) searches (see ( A.6 ) in the Appendix), 
again we see that there is a quadratic speedup. 

The idea of proof follows by combining those in [gj and |^ . Let the Hamiltonian H^ be given 
as in (2.2). We wish to add a somewhat arbitrary (generally, time-dependent) driving Hamilto- 
nian Ho{t) to Hl so that the terminal state (at time T) is in L (which, after a measurement, 
becomes one of the eigenstates |wi), \u}2), . . . , \we)). Our objective is to find a lower bound on 
T. Of course, H]j{t) and T must be independent of L, since they are part of the algorithm 
prescribed for determining L. 

Let \wi) € 7i be an arbitrary initial state such that {wi\wi) = 1. Let ipL{t) denote the 



solution of the Schrodinger equation 



lj^\Mt))^iHL + HD{tmLit)), 



0<t<T, 



IV'l(O)) = l^/)) (initial condition) 

\tpL{T)) = y ctilwi) e L, (terminal condition) 



Note that ai e C for i = 1, 2, . . . ,i and 



E 



aA'^l 



(2.25) 



(2.26) 



because the evolution process is unitary at any t e (0,T]. On the other hand, let \ip{t)) evolve 
with Hoit): 



i-Mt))^HD{t)\m), 0<t<T, 
at 

1^(0)) = \wi). 



(2.27) 



Lemma 2.7. Assume that N = N/£ is an integer. Let the orthonormal basis B = {\'uii) \ 1 < 
i < N} he grouped into N disjoint subsets 

N 



^-u.,^^^ 



(2.28) 



where each Bk = {{wi^k) I * = Ij 2, . . . , £} contains exactly £ orthonormal basis elements. Then 
we have 



N 



2N~2Vn < Y.{i>LAT) - ^{T)\^lAT) - V'(r)) 



fe=l 
< 2E^/N f. 



(2.29) 



where {ipLkit)) ** the solution |'0L(t)) of (2.2L) with L — Lk — span Bk in both the differential 
equation and the terminal condition. Consequently, 



T> (1-e 



-N 



N' 



E 



E 



(2.30) 



where e^ — N ^^^ -^ as N -^ oo. 



Proof As in M, (1.8)], we have 



(^lJT) - V(T)|Vl,(T) - ^{T)) = 2 - Y,i{^^,kW,M^{T)) + (^(r)|«,,feu;,,fe)], (2.31) 



and, therefore 

N 



i=l 



N 



Y,{^L,{T)-^{T)\^LAT)-i^{T)) = 2N~Yl 



fe=i 



k=l 



i=l 



i=l 



Let Pi^ : H ^ Lk be the orthogonal projection of H onto L^- Then 



N 



N 



< 



J2\PLMiT))\\'<l, 



k=l 1=1 fc=l 

from which, we apply the Cauchy-Schwarz inequality and obtain 

N I e. 

fc=l 1=1 i=l 



< 2iV 



1/2 



(2.32) 



(2.33) 



Combining (2.31) and (2.33), we have established the left half of the inequality (2.29) 
Next, mimicking M, (19)-(21)], we have 

|[(v^L. w - m\^L,it) - m)] = 2 iMi'LMiHLjm) 



N 



<2|(V'L,(t)|iJiJ^(i))| 

<2\\HLMt))\\=2E 
= 2E\¥Ljm)l 

N 



ElK'^IV^W)!' 



1/2 



dt 



^[(v-iji) - mi^L^t) - ^Km < 2Ej2WLjHt))i 



fc=i 



N 



fe=l 



and from ^ |Pij,|i/)(i))| = 1, by an application of the Cauchy-Schwarz inequality again, we 

fc=i 
obtain 



d 



N 



-T, EK^i. W - ^WI^L. W - V'(0>] < -^EN^ 



/2 



dt 



(2.34) 



fc=i 



Integrating (2.34) from to T, noting that \i})l^{Q) — "0(0)) = 0, we have verified the right half 



of inequality (2.29) 



D 



In general, N/(. is not necessarily an integer. Therefore, the disjoint union (2.28) is not 

always possible. Define N — [N/P\, namely, the integral part of the rational number N/L Then 

N 

— =iV + (5, where Q<5<1. 



Then we can rewrite ( 2.28 ) as 



• N 



(2.35) 



.N 



where lJfe=i-^'= ^^ ^'^ arbitrary collection of disjoint sets of t orthonormal basis elements con- 
taining a total of N(. of them, and R is the remaining set of orthonormal basis elements with 
cardinality (.5{< £). The proof of Lemma 2.7 extends to this case except for some tedious de- 
tails of bookkeeping concerning the short set R, which we omit. We therefore have arrived at 
the following order of time-optimality for the continuous-time generalized Grover algorithm for 
multiobject search. 



10 



Theorem 2.8. Assume that N/£ is large. Then it requires at least 



^-^M, e^^,>0, e^,,^0{{N/i)-^f% 



T — 

E \ £' 



time duration in average for the driven general Hamiltonian system (2.2V) to reach the subspace 



L. n 

3 Discrete Time Case: Straightforward Generalization of 
Grover's Algorithm to Multiobject Search 

In this section we generalize Grover's search algorithm in its original form |Q, |[ to the 
situation where the number of objects satisfying the search criterion is greater than 1. We 
are considering the discrete time case here, which may be regarded as a discrete-time sampled 
system of the continuous-time case treated in the preceding section. Unlike the continuous- 
time case, there have been a relatively rich literature studying the generalization of Grover's 
discrete-time algorithm to multiobject search (see |B, 0, 0, H] and the references therein). Our 
presentation below gives more formalized arguments than those earlier contributions, provides 
a clearer Hilbert space framework, and settles a relevant question contained in l3| . 

Let the database {wi \ i = 1,2, . . . , N}, orthonormal eigenstates {\wi) | i = 1, 2, . . . , N} and 
the oracle function / be the same as given at the beginning of §2. The definitions of H, L and 
£ remain the same. 

Define a linear operation in terms of the oracle function / as follows: 

JlK-) = (-l)/(-^V,), J = 1,2,..., TV. (3.1) 

Then since II is linear, the extension of II to the entire space Ti. is unique, with an "explicit" 

representation 

e 
Il=I-2Y,H){w,\, (3.2) 

where / is the identity operator on TL. I^ is the operator of rotation (by t:) of the phase of the 
subspace L. Note again that the explicitness of (p^) is misleading because explicit knowledge of 
{\wj) \ I < j < £} and £ in (|3.2|) is not available. Nevertheless, (p^) is a well-defined (unitary) 



operator on Ti. because of (3.1). (Unitarity is a requirement for all operations in a QC.) 



We now define \s) as in (2.15|). Then 



1 ^ 1 ^ IM-£ . , 

^EK) = ^f7EK) + V^^I^); seeOforlr). (3.3) 



t—1 ^ 2—1 



Now, define another operator, the inversion about average operation, jnst as in Grover [|7|, ^: 

/«=7-2|s)(s|. (3.4) 



11 



Note that Is in (3.4) is unitary and hence quantmn mechanicaUy admissible. Ig is explicitly 
known, constructible with the so-caUed Walsh-Hadamard transformation. 



Lemma 3.1. Let L be defined as in (2.8). Then {{wi), \r) | i = 1,2, . . . ,£} forms an orthonor- 
mal basis of L. The orthogonal direct sum Ti = L(BL^ is an orthogonal invariant decomposition 
for both operators Ij and Ig . Furthermore, 

(i) The restriction of Ig on L admits a unitary matrix representation with respect to the 
orthonormal basis {\wi), \w2), ■ . ■ , \wi), \r)}: 



A — \aif\{i+\)y.{i+\)i 



uJ(^+l)x(£+l), 
A -1 



l<i,J<^, 



aj7 = < 



2^/iv"^ 



N 



N 
1, 



[S^^i+i + Sj^i+i), i = £+1 or j = £+l,i^ j, 

i^ j = e+1. 



(3.5) 



(ii) The restriction oflg of L^ isPj^j^, the orthogonal projection operator on L-^ . Consequently, 
Is\j± = Ij±) where Ij± is the identity operator on L^ . 



Proof. We have, from (3.3) and (3.4), 



Ig=I-2 



1 ^ 



N -t. 



N 



1 ^ 



N ~£ 



N 



Ek,)(u;,) + |r)(r|+P£_ 



- 4eek)(-.i 



N 



J2{\w,){r\ + \r){w,\) 



EE(^^ 

i=i ]=i 

21 



,4=1 

2 



'N ~£ 



N 



\r){r\ 



y - ^ / iWi){wji 



2y/W~^ 
N 



^(K)(r| + |r)(^,|) 



+ {j^-^]\r){r\+^Z. 



The conclusion follows. 



(3.6) 
D 



The generalized "Grover's search engine" for multiobject search is now defined as 



U = -IglL. 



(3.7) 



Lemma 3.2. The orthogonal direct sum Ti — L (B L-^ is an invariant decomposition for the 
unitary operator U, such that the following holds: 



12 



(1) With respect to the orthonormal basis {\wi), . . . , {wg), \r)} of L, U admits a unitary matrix 
representation 



U\l = 



''V\{i+l)x(l+l), 



jr 



i<i,j<e, 



2VN^ 

TV 
2i 

TV' 



(^j,<'+i - Si^i+i), i = £+l or j = £+l,i^ j, 
i= j = N + 1. 



(3.8) 



(2) The restriction of U on L^ is 



Proof. Substituting (3.2) and (3.6) into (3.7) and simplifying, we obtain 
U = —IsIl = ■ ■ ■ (simplification) 



i=l j = l ^ ^ i=l 

2£ 



+ {^-J^]\r){r\-Vl.- 



The proof follows. 



D 



Lemmas 3.1 and 3.2 above effect a reduction of the problem to an invariant subspace L, just 
as Prop. EJ did. However, L is an {£ + l)-dimcnsional subspace where £ may also be large. 
Another reduction of dimensionality is needed to further simplify the operator U. 



Proposition 3.3. Define V as in (2AT). Then V is an invariant two-dimensional subspace of 
U such that 

(1) r,se V; 

(2) U{V) = V. 

Proof. Straightforward verification. D 



Let \w) be defined as in (2.19). Then as in §2, {|w), \r)} forms an orthonormal basis of V. 
We have the second reduction, to dimensionality 2. 

Theorem 3.4. With respect to the orthonormal basis {\w), |r)} in the invariant subspace V,U 
admits the unitary matrix representation 



U = 



N-21 2V£(JV~£) 

N N 

2^1{N-IL) jV-2£ 

TV N 



COS a sm t 
— sin 9 cos ( 



= sm 



_J2^£{N~£) 



N 



Proof. Use the matrix representation (3.8) and (2.19) 



(3.9) 



n 
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Since \s) S V, we can calculate [/™|s) efficiently using 

[/""Is) = U'' 



(^S"-> 



N -£. 



N 



(by M) 



t/"^ 



^ 



N 



cos t/ — sm ( 
sin 6 cos 6* 

cos{m6 + a) 
sm{m9 + a)] 



N -£ 
N ' 
- _j_ 



N 



a = cos ^ 



(3.10) 



= cos{m9 + a) ■ \w) + sin{m9 + a) ■ \r). 
Thus, the probability of reaching the state \w) after m iterations is 

Pjn = cos^ {m9 + a); 



(3.11) 



cf. ( ^.23 ) in the continuous-time case. If £ <C iV, then a is close to 7r/2 and, therefore, (3.11) is 
an increasing function of m initially. This again manifests the notion of amplitude amplification. 
This probability Pm is maximized if m9 + a = tt, implying 



TT ~ a 



the integral part of 



TT — a 



When i/N is small, we have 



= sm 



_i 2^£iN-£) 



N 



sin^i 2 



"^" 27V ~ 8 V^ ^"' 



2xl-+Oii£/Nf^% 



_i £ _n 

~^^2 



-^+0{{£/N^'^f) 



Therefore 



--{f 



Vlv 



O 



((ViVl/2)3)j| 



2Ji+0{{l/Nf/^) 



IT 


N 


" 


( Ml 


-t\ 


1 — 


l + O 




4 \ 


1 £ 




V^JJ 



(3.12) 



Corollary 3.5. The generalized Graver's algorithm for multiobject search with operator U given 
by (S.l) has success probability P„i = cos^{m9 + a) of reaching the state \w) G L after m 



iterations. For £/N small, after m — jy^N/£ iterations, the probability of reaching \w) is close 
to 1. D 
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The result (3.12) is consistent with Grover's original algorithm for single object search with 



^ = 1, which has TO w l^/N. 

Theorem 3.6. (Boyer, Brassard, H0yer and Tapp P]). Assume that i/N is small. Then 
any search algorithm for i objects, in the form, of 

UpUp-i .. .Ui\wi), 

where each Uj,j = 1,2, . . . ,p, is a unitary operator and \wj) is an arbitrary combination state, 



takes in average p = 0{\/ N / 1) iterations in order to reach the subspace L with a positive 
probability P independent of N and i. Therefore, the generalized Grover algorithm in Cor. \3.^ 
is of optimal order. 

Proof. This is the major theorem in [g[; see Section 7 and particularly Theorem 8 therein. Note 
also the work by C. Zalka who considered some measurement effects in |l5|. D 



Unfortunately, if the number £ of good items is not known in advance. Corollary 3.5 does 
not tell us when to stop the iteration. This problem was addressed in B, and in another way 
in M. In a related context an equation arose that was not fully solved in |3|. We consider it 
in the final segment of this section. As in [pi §3], consider stopping the Grover process after 
j iterations, and, if a good object is not obtained, starting it over again from the beginning. 



From Corollary 3.5 , the probability of success after j iterations is cos^ {j0 — a) . By a well-known 
theorem of probability theory, if the probability of success in one "trial" is p, then the expected 
number of trials before success is achieved will be 1/p. (The probability that success is achieved 
on the fcth trial is p{l ~ p)''^^. Therefore, the expected number of trials is 

Y^kp{i-pr^ = -pY^^ii-p)'^ = -p-^'-^ , (3.13) 

k=l k=l ^ P P 

which is 1/p.) In our case, each trial consists of j Grover iterations, so the expected number of 
iterations before success is 

E{j)=j-aec^{je-a) . 

The optimal number of iterations j is obtained by setting the derivative E'{j) equal to zero: 

= E'{j) = sec^(j6' -a)+ 2jesec^(j0 - a) ta,n{je - a), 

2j9 = -cot{{j9-a)). (3.14) 

(In 1^, §3], this equation is derived in the form 4-dj = tan((2j + !)'&), which is seen to be 



equivalent to ( 3.14 ) by noting that iS — ^ = ^ — a. Those authors then note that they have not 



solved the equation Mj = tan((2j + !)■&) but proceed to use an ad hoc equation z = tan(z/2) 
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with z = A-dj instead.) Let us now approximate the solution j of (3.14) iteratively as foUows 



From (3.14), 



2j9 s[n{j0 - a) + cos{j0 - a) = , 
g2»(9,-a) ^ (^20^- + l)/{t29j - 1) , 

and by taking the logarithm of both sides, we obtain 

'i2ej + l 



2i{6j — a) ~ iiim + i arg 



126 j - 1 



In 



i2ej + 1 



i2ej - 1 



(3.15) 



(3.16) 



for any integer n. Assume that i/N is small so that j is large, but we are looking for the smallest 



such positive j. Note that the logarithmic term in ( 3.16 ) vanishes, and 



arg - 



i29j + 1 
i2ej - 1 



= —2 tan 



2dj 



= 2 



by taking n = in (3.16), we obtain 



1 
2ie 
1 " 



y. (-i)'?+i / 1 y^+^' 



^3 



-'^ + om)-^) 



The first order approximation ji for j is obtained by solving 

ii-^(a+v/a2-2) 



(3.17) 



20ji 

1 
29- 



(3.18) 



Higher order approximations jn+i for Jt- = 1,2,... , may be obtained by successive iterations 



jn+i = - I a - tan 



1 



2ej„ 



based on ( |3.14 ). This process will yield a convergent solution j to (3.14) 



Appendix: Random Mult i- Object Search 

Given a set of N unsorted objects, among which £ of them are the desired objects that we 
are searching for, how many times in average do we need to search in order to obtain the first 
desired object? 
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Because the N objects are unsorted, the above problem is equivalent to a familiar problem 
in probability theory: 

"An urn contains N balls, £ of them are black and the rest are white. Each time 
draw a ball randomly without replacement. Determine the number of times in 
average needed in order to draw the first black ball". (A.l) 



A different version of random multi-object search would correspond to (A.l) but with repl 



ment after each drawing. This search method is less efficient than (A.l), but can be treated by 
a similar approach as given below and, thus, will be omitted. 



Even though we believe the solution to (A.l) is available in the literature, we could not 
locate a precise citation and, therefore, feel the urge to write this Appendix. 
We define a random variable 

Tf, = number of drawings needed to draw a ball randomly 

and without replacement until the first black ball is out. (A. 2) 

Our objective is to calculate E{Ti,), the expected or the average value of Ti,. 
Obviously, 

Tbe{l,2,... ,N-i+l}. (A.3) 

We use (") to denote the combinatorial coefficient nl/[j\{n — j)l], and use P{A) to denote the 
probability of a given event A (measurable with respect to the random variable Tf,). 

Proposition A.l. For j e {1, 2, . . . ,N - £+ I}, 



N 



^m = .') = M^«ToTI)- <*-^' 



Proof. By the very definition of T^, in ( [A.2| ) , we know that 

P{Tii = j) = P(the first j — I drawings result in j — 1 white balls, but 
the j-ih drawing results in a black ball). 



Therefore (A.4) follows. □ 



Proposition A. 2. (IT^, p. 54, (10), (11)]) For any m,n € {0, 1, 2, ... } and m < n, 

^-^ \ m ) ^-^ \m) Vto+I 

fc=0 ^ ^ j=m ^ ^ ^ 
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Proof. By Pascal's formula, 



we have 



J + 1 A / J 

771+1/ V"^ + 1 



(A.5) 



0=m 



Z-^ \ m I Z-^ 



m + 1 1 \?77,+ l 



J + l\ '^ fj + l 



■^ Vm+ 1/ . ^ 



771+1 



J— m ^ ' J— m — 1 

'r7 + l\ / ni \ / n + I 

,777 + 1/ Vt/I + 1/ ^ V 777 + 1 



D 



Theorem A. 3. 



E{n) 



N+l 

e + 1 ■ 



(A.6) 



Proof. From ( A. 4 ), we have 



W-f+l 



E{n)^ ^ J 






{N-ey. 



{j~iy.{N-e-j+iy. 



1 (,■ i) • [^ - 0- - 1)] 



£(iV- 


iy. 


iV 




£{N^ 


£)\ 



•3 



i=i 



U-mN-j+iy -(N-j + l) 



Eo + 1 



J=0 



{N-e~jy. 



m 



J2{N^i+l^k) 



k=0 



{k + e-iy 
ki 



(where k = N - £ - j) 



{N- 


-£ + 1) 


{N- 


-£ + 1) 


{N^ 


^£ + 1) 



£{N-£y'^ {k + £-iy 



m ^^ k\ 

k=0 



i{N-£y'^ {k + £-iy 



m 



tl (^-1)! 



'n-i {k+i-iy 

Ek\{e-iy 
N\ 

fe=o i\{N-iy 
_j_^'^k + £-l 



N-{e+i) (k+ey 
y^ kw. 



N\ 



fe=o e\{N-ey 



£ 



1 " ^'' l'k + £ 
\eJ k=o 



Now, applying Proposition A. 2, we obtain 

£-1 



N \ ^ fk + £-l 
-ij+S 

^ fe=0 



iV 



iv\ ""-^''fk- 



E 

*:=0 



N -£ fN 



N 



£ + 1 £ + 1 \£ 



Substituting (A.S) and (A. 9) into (A. 7), we obtain 



(A.7) 



(A.8) 
(A.9) 



E{Ti,) = • • • (continuing from (|A.7|)) 



<--')i':)i.^o--<"--"-fT/^^' 



D 
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Remark A.l. When £ = 1, by (A. 6) it takes (A^+ l)/2 searches in average to obtain the desired 
single object. In the literature, this is usually cited as iV/2, which of course differs negligibly 
for large N . 



For iV = 4 and £ = 1, by (A. 6) it takes (4 + 1)/(1 + 1) = 2.5 times of search on average 
to obtain the desired item. But in 0, p. 3408], it is stated that it takes 9/4 — 2.25 times of 
search on average. The reason for the discrepancy is a different definition of successful search. 
The authors of regard the search as completed as soon as the location of the desired item is 
known, even if that item has not been physically "drawn from the urn" . They therefore count 
the worst case, where the desired item is the last one drawn, as requiring only 3 steps instead of 
4. This redefinition could be incorporated into our theorem at the expense of some complication; 
but it seems to us to be the less natural convention in the scenario of multiple desired objects 
only one of which is required to be "produced" . 

n 
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